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ABSTRACT 



We describe application of the 'shapelet' linear decomposition of galaxy images to morpho- 
logical classification using images of ~ 3000 galaxies from the Sloan Digital Sky Survey. After 
decomposing the galaxies we perform a principal component analysis to reduce the number of 
dimensions of the shapelet space to nine. We find that each of these nine principal components 
contains unique morphological information, and give a description of each principal component's 
contribution to a galaxy's morphology. We find that galaxies of differing Hubble type separate 
cleanly in the shapelet space. We apply a Gaussian mixture model to the 9-dimensional space 
spanned by the principal components and use the results as a basis for classification. Using the 
mixture model, we separate galaxies into seven classes and give a description of each class's phys- 
ical and morphological properties. We find that several of the mixture model classes correlate 
well with the traditional Hubble types both in their morphology and their physical parame- 
ters (e.g., color, velocity dispersions, etc.). In addition, we find an additional class of late-type 
morphology but with high velocity dispersions and very blue color; most of these galaxies ex- 
hibit post-starburst activity. This method provides an objective and quantitative alternative to 
traditional and subjective visual classification. 

Subject headings: methods : data analysis — methods : statistical — techniques : image processing — 
galaxies : fundamental parameters — galaxies : statistics 

1. INTRODUCTION robustly defined, and it relies on manual classifica- 



The first widely accepted scheme for morpho- 
logical classification of galaxies was the Hubble 
tuning fork (Hubble f936). Although Hubble's 
system has proven useful as a framework for dis- 
cussion, and has provided insights into the re- 
lationships between galaxy morphology, galactic 
evolution, stellar populations, and other physical 
quantities (van den Bergh 1998; Elmegreen f998), 
it is lacking in several respects. Most importantly, 
the Hubble scheme is subjective in the sense that 
morphological differences between classes are not 



tion by experts. Also, there are many galaxies that 
do not fit into the Hubble sequence, such as enor- 
mous cD galaxies (Matthews, Morgan, & Schmidt 
f964; Morgan & Lesh 1965), H I gas-poor anemic 
galaxies (van den Bergh 1976), and dusty amor- 
phous galaxies (Sandage & Brucato 1979). Often 
such galaxies are 'pigeon-holed' into inappropri- 
ate classifications, resulting from the Hubble sys- 
tem's lack of flexibility and use of discrete classes. 
In addition, many low luminosity dwarf galaxies 
do not easily fit into Hubble's scheme (Elmegreen 
1998), nor do those known as peculiar galaxies 
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(Arp 1966). Hubble's classification scheme does 
not provide an adequate framework for high red- 
shift (z > 0.3) galaxies (van den Bergh, Cohen, & 
Crabbe 2001; Abraham et al. 1996a) or those in 
the cores of rich clusters (Koopmann & Kenney 
1998; Abraham et al. 1994). This is not surpris- 
ing, as Hubble based his system on the morpholo- 
gies of galaxies in the nearby universe. Galaxies at 
high redshift are seen at significantly earlier stages 
in their evolution, and are more likely to undergo 
mergers and collisions that distort their morphol- 
ogy. In addition, the elliptical sequence EO-E7 of 
the tuning fork is heavily dependent on the orien- 
tation of the galaxy, rather than its intrinsic mor- 
phology. 

Motivated by the ambiguities in Hubble's clas- 
sification framework, there have been many ad- 
ditions and modifications to it. In an attempt 
to make the Hubble sequence more complete, de 
Vaucoulcurs (1959) introduced finer divisions re- 
garding barred spirals and differentiation between 
those galaxies showing rings and those that do not. 
However, the addition of these parameters has 
not been conclusively tied to physically significant 
differences among galaxies (Kormendy 1982). A 
twelve-stage classification sequence for spiral arms 
was developed by Elmegreen & Elmcgreen (1982), 
where the classifications range from patchy 'floc- 
culent' arms to well defined 'grand design' arms. 
It has been shown that arm type and Hubble type 
appear to be unrelated (van den Bergh 1998). The 
addition of luminosity classes modeled after stel- 
lar luminosity classifications was given by van den 
Bergh (1960). A simpler system of galaxy classifi- 
cation based on the central concentration of light 
was developed by Morgan (1958), and has proven 
to be useful for classifying galaxies at large red- 
shifts and in the cores of rich clusters (van den 
Bergh 1998). 

Most of the recent work on galaxy classification 
has been in the pursuit of an automated and ob- 
jective classification system. It has been shown 
that manual classification of images with limited 
dynamical range and poor resolution leads to a dis- 
appointingly large difference among human classi- 
fiers (Nairn et al. 1995; Abraham et al. 1996b). 
This discrepancy is most pronounced at high red- 
shifts, where the traditional Hubble sequence be- 
gins to break down. Quantitative conclusions of 
how a galaxy's morphology is related to its phys- 



ical parameters and evolution have been hindered 
by the qualitative and environment-dependent na- 
ture of the Hubble classifications. Many physical 
properties of galaxies have been shown to be cor- 
related (e.g., Blanton et al. 2003a) and one would 
desire a quantitative description of how a galaxy's 
morphology fits in; i.e., a morphological classifi- 
cation scheme should probe the important physi- 
cal properties of galaxies. In addition, the advent 
of enormous astronomical databases necessitates 
automatic classification. For example, the Sloan 
Digital Sky Survey (SDSS, York et al. 2000) is 
expected to catalog the spectra for roughly one 
million galaxies, making manual classification im- 
practical. 

Many methods of automatic and objective clas- 
sification have been proposed. A method which 
utilizes the central concentration index of a galaxy 
was proposed by Doi, Fukugita, & Okamura 
(1993), and Abraham et al. (1994, 1996b) have 
developed a promising system based on a galaxy's 
central concentration of light and asymmetry. 
Conselice (2003) has taken this a step further 
through the addition of a 'dumpiness' parameter, 
and relates the central concentration, asymme- 
try, and dumpiness parameters to the underlying 
physical processes and history. Much of the work 
toward automatic galaxy classification has been in 
the use of artificial neural networks (e.g., Goderya 
& Lolling 2002; Odewahn 1995)). Neural networks 
are powerful learning tools, where the central idea 
is to extract linear combinations of the inputs as 
derived features, and then model the target as a 
nonlinear function of these features (Hastie, Tib- 
shirani, & Friedman 2001). Neural networks have 
met with approximately the same success as vi- 
sual classification (e.g., Ball et al. 2003), but have 
relied on the training set for classification; neural 
networks can replicate visual classification, but 
not create a classification framework. Additional 
techniques include principal component analysis 
(Frei 1999) and fractal dimensions (Thanki, Rhcc, 
& Lepp 2000). Those classification schemes that 
are able to separate galaxies cleanly often utilize 
only ~ 100 — 200 galaxies, resulting in a fairly 
small number of galaxies in each class and mak- 
ing comparison difficult. In addition, a complete 
quantitative description of galaxy morphology is 
still lacking, as most systems utilize only two or 
three parameters (e.g., Doi, Fukugita, & Okamura 



2 



1993; Abraham & Merrifield 2000). Techniques 
involving Fourier analysis also hold promise (Ode- 
wahn et al. 2002; Trinidad 1998), but rely on 
analyzing azimuthal profiles of a galaxy, rather 
than analyzing the full 2-dimensional image of the 
galaxy. 

One promising approach toward a quantita- 
tive, complete, and automatic system of morpho- 
logical classification is to linearly decompose a 
galaxy into 'shapelets' (Rcfrcgicr 2003). Shapelets 
are Gaussian-weighted Hermite polynomials which 
happen to be the cigcnstatcs of the quantum har- 
monic oscillator Hamiltonian. The have been 
shown to be useful in weak gravitational lcnsing 
measurements (Chang & Refregier 2002; Refregier 
& Bacon 2003) and for image simulation (Massey 
ct al. 2003). They form an ideal framework in 
which to classify galaxies as they are mathemati- 
cally well defined and understood, compact, form 
an orthonormal basis, and use all of the shape in- 
formation in a galaxy. Shapelets are preferred over 
other techniques that utilize a basis expansion, 
such as wavelets (Farge 1992) or curvelets and 
ridgelets (Starck, Donoho, & Candes 2003). These 
techniques rely on translated and dilated versions 
of a single shape (e.g., a Gaussian wave packet in 
the case of the Morlet wavelet), while shapelets 
form a complete set of varying shapes with the 
same scale. Because the shapelet decomposition 
maps a 2-dimensional galaxy image onto a vector 
in a multi-dimensional space, galaxy classification 
amounts to finding structure in the shapelet coeffi- 
cient space. The shapelet method provides a con- 
tinuous framework to describe galaxy morphology 
and classification, rather than several discrete bins 
as in traditional classification methods. 

Because differences in resolution contribute to 
differences in observed morphology we choose to 
artificially redshift all galaxies and smooth them 
to the same physical scale. This assures that 
galaxies are resolved on the same physical scale, 
rather than on the same angular scale, ensuring 
that the resolution is independent of galaxy dis- 
tance or size. Other techniques have used a similar 
method (Abraham et al. 1996b; Conselice 2003), 
however we do not know of any frameworks that 
measure the morphology of galaxies smoothed to 
the same physical scale. The technique is fully au- 
tomatic, allowing efficient analysis of galaxy im- 
ages. We apply the method to data from the 



SDSS, using both galaxies previously classified ac- 
cording to Hubble type and previously unclassified 
galaxies. 

2. SHAPELETS 

2.1. Cartesian Shapelets 

Much of the shapelet formalism can be found 
in Refregier (2003). We first construct the dimen- 
sionless basis function in 1-dimcnsion: 



4>n{x) 



2"7r5n! 



H n (x)e i 



(1) 



where n is a non- negative integer and H n (x) a 
Hermite polynomial of order n. In practice, we use 
the dimensional basis functions, which are easily 
described from the dimensionless set by 



B n (x;j) =7 s</>„(7 1 x) 



(2) 



where 7 is the characteristic scale of the object to 
be analyzed. These functions are orthonormal, i.e. 



B n (x; j)B m (x; j)dx = S n 



(3) 



Because the Cartesian shapelets are separable 
in x, we can easily construct the 2-dimensional 
basis functions from the 1-dimcnsional: 



0„(x) EE (j) ni {x 1 )(j) n2 (X2), 



(4) 



where x = (or 1,0:2) and n = (711,712). The dimen- 
sional basis functions become 



i? n (x,7)EE 7 -Vn(7 _1 x), 



(5) 



and are orthonormal. 

These 2-dimcnsional shapelets form a complete 
orthonormal basis for smooth and integrable func- 
tions of two variables, allowing us to decompose 
any well-behaved two-dimensional image /(x) into 
a sum of shapelets as 

00 

/(X)= /n£n(x; 7 ), (6) 

with shapelet coefficients 

/» = y./(x)i? n (x;7)d 2 x, (7) 
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from the orthonormality property. The first sev- 
eral 2-dimensional Cartesian shapelets are shown 
in Figure 1. Dirac notation is a convenient and 
concise way of representing much of the math of 
quantum mechanics, and we will often use it in this 
paper. In Dirac notation, the n th state is denoted 
as \n), and has x-space representation (x\n) 

2.2. Flux and Centroid 



The dimensional polar basis functions are 

A ninr (r,<prf) = 7 _1 Xn,n r (7 _1 >",P) (12) 
and have the orthonormality property: 

'led ~2-n poo 

4>n- J dip J drrA ninr {r,y\i)A n , in , r {r,w\i) = 8 ninl 5 nr< . 



The flux and center of an astronomical object 
are two of the most basic quantities that one mea- 
sures, and computing them is necessary for any 
classification scheme that relies on linearly decom- 
posing a galaxy into basis functions. The total flux 
F = J f(x)d 2 x of the galaxy may be computed 
from the shapelet coefficients as 



F = ^ 1 2(2 ~ ni ~ 



■n 2 )/2 



Til 

ni/2 



n 2 
n 2 /2 



fn 



(8) 

where the sum is over even n\,n 2 and the paren- 
theses signify the binomial coefficient. The cen- 
troid of an object xci = J Xif(x)d 2 x/F is given 

by 

I 2 °dd even 

aoi = ^££(„ 1 + l)i 2 i< a -i— > x 

tli n 2 

l 1 

/ ni + 1 \ 2 / n 2 \2 

[ (ni + l)/2 J [ n 2 /2 j / m „ 2 , (9) 

and similarly for xq 2 - 

2.3. Polar Shapelets 

It is of practical interest to construct shapelets 
separable in the polar coordinates r and ip, as po- 
lar coordinates are more appropriate for describ- 
ing a galaxy's shape. The polar basis functions are 
eigenstates of the QHO Hamiltonian and angular 
momentum simultaneously. One can show that 
in x-space the polar basis functions Xn t ,n r { r , f ) = 
(x\ni,n r ) are given by 

Xn inr {r,*) = [^ ni \n r r^H ninr {r)e- r2 / 2 e^- n ^ : 

(10) 

where H nirir (r) are the 'polar Hermite polynomi- 
als'. The energy eigenvalue n and the angular mo- 
mentum eigenvalue m are related to n; and n r by 

n = n r + ni 

m = n r — ni, m = — n, — n + 2, . . . ,n — 



(13) 

Plots of the first several polar shapelets can also 
be seen in Figure 2. 

We often need to change between Cartesian and 
polar shapelets, and require a relationship that al- 
lows us to do so. One can show that this is given 
in the form of the transformation matrix between 
these two bases, 



ri\\n 2 \ 



S ni +n 2 ,n l+ n r 2 I I ) 5 <+n{W 

From this equation, we see that only those states 
with m + n 2 = ni + n r = n are mixed. 

3. THE DATA 

The SDSS (York et al. 2000) has been pro- 
ducing imaging and spectroscopic surveys of the 
Northern Galactic Cap over ir stcradians. A 
2.5m telescope at the Apache Point Observatory, 
Sunspot, New Mexico, observes the sky in five 
bands (u, g, r, i, z, Fukugita et al. 1996; Hogg, 
Finkbeiner, Schlcgcl, & Gunn 2001; Smith et al. 
2002) between 3000 and 10000 A, using a drift- 
scanning mosaic CCD camera (Gunn 1998), which 
detects objects to a flux limit of r <~ 22.5 mags. 
The survey, when finished, is expected to spectro- 
scopically observe 900,000 galaxies down to ru m w 
17.77 mags (Strauss et al. 2002), 100,000 Lumi- 
nous Red Galaxies (Eisenstein 2001), and 100,000 
quasars (Richards 2002). The spectroscopic fol- 
low up uses two digital spectrographs on the same 
telescope as the imaging camera, and the spectro- 
scopic samples are assigned plates and fibers using 
an algorithm described by Blanton et al. (2003b). 
The astrometric calibration is described in Pier 
et al. (2003). Details of the galaxy survey can be 
found in the galaxy target selection paper (Strauss 
et al. 2002), and other principles of the survey are 



4 



• 




• 




• • 




1 1 



H ® 

10, 2> I 1 ,2> I 2 , 2 > \3.2> 

m m m 



i 



Fig. 1. — Plots of the first several 2-dimensional 
Cartesian shapelets \rii 1 n2). Dark areas are posi- 
tive, light are negative. 



1 4 , -2 > 

m 

13,-1 > 


1 6 , -2 > 

E3 

15 , -1 > 


u 


1 10,-2 > 

□ 




■ • I 


■ 




□ 


□ 


□ 

17, 1 > 


H 




■ • ■ 


■C»l 


E 








1 10 , 2 > 




in 


u 





Fig. 2. Plots of the real part of the first several 
2-dimensional Polar shapelets \n,m). 



described in the Early Data Release (Stoughton et 
al. 2002). 

We use two samples in this analysis. We first 
investigate the shapelet method using the r-band 
data for 556 of the 1482 well-resolved galaxies used 
by Nakamura et al. (2003, hereafter Sample 1) 
to estimate the morphology-dependent luminos- 
ity function. We have chosen this group in order 
to compare the shapelet results with traditional 
Hubble type, as this catalog contains manual clas- 
sifications of Hubble type. The manual classifica- 
tions make no distinction between spirals with bar 
structure and spirals without. Sample 1 contains 
those galaxies of the Nakamura et al. (2003) sam- 
ple that have redshift z < 0.07 and a PSF width of 
less than 2.0 kpc when projected onto the plane of 
the galaxy. This allows us to smooth all galaxies 
of Sample 1 to a constant scale before decompos- 
ing them (see § 4) . These galaxies are included in 
the SDSS Early Data Release. 

We next investigate the shapelet method on the 
r-band images of a volume-limited sample of 3037 
nearby galaxies (hereafter Sample 2) included in 
the SDSS DR1 (Abazajian et al. 2003). These 
galaxies were chosen because they have projected 
PSF widths of less than our desired resolution 
of 2.0 kpc, allowing use to smooth them to this 
scale. They have redshifts z < 0.07 and abso- 
lute magnitudes R < — 19. We made the redshift 
cut at z = 0.07 because projected PSF widths 
become larger than 2.0 kpc for redshifts greater 
than this, and we chose galaxies of absolute mag- 
nitudes R < — 19 to allow a uniform distribution 
of absolute magnitude with redshift. Figure 3 dis- 
plays the r-band PSF, redshift, and r-band abso- 
lute magnitude distributions for this sample. 

4. DECOMPOSITION METHOD US- 
ING SDSS DATA 

We describe here how we decompose galax- 
ies using SDSS data and the shapelet formalism. 
First, we input the atlas image /i(x) for a given 
color band (u,g,r,i, or z), as well as the adaptively 
weighted moments in the horizontal and vertical 
directions, I xx and I yy (Fischer et al. 2000). The 
atlas images are cutouts that include all significant 
pixels around each detected galaxy (Stoughton et 
al. 2002). The atlas images of several galaxies clas- 
sified according to Hubble type are shown in Fig- 
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ure 4. We use the cataloged SDSS data to find 
the image center xc and the position angle 9 pos 
from the horizontal. We rotate the image an an- 
gular distance 9 pos so that each has its major axis 
along the horizontal. We compute the character- 
istic scale 7 of our analyzing shapelets, given by 



7 : 



(15) 



If the scale 7 (in pixels) was found to be less than (3 
pixels, where (3 is the scale of the PSF, then we set 
7 = (3. If necessary we pad the image with blank 
sky out to a radius of IO7 to assure the orthogonal- 
ity of the basis functions. We add artificial noise 
to the padded regions using the recorded value for 
(T s ky, and then subtract the mean of the image. 

To assure uniformity within the sample, we arti- 
ficially rcdshift all galaxies to z = 0.07 and smooth 
them to a resolution that would be seen with a 
PSF 5o( x ) °f standard physical width (3q. Doing so 
keeps the physical scale of resolution for all galax- 
ies in our sample constant, assuring that infor- 
mation that may contribute to classification (i.e., 
separation in shapelet space) is not affected by dif- 
ferences in resolution. Artificial redshifting is ac- 
complished by rebinning the galaxy image so that 
it occupies fewer pixels. We neglect the changes 
in the ratio of the galaxy flux to sky flux, as well 
as the increase in noise (e.g., from the galaxy, sky, 
dark current, etc.), that would be seen at higher 
z. Such changes are negligible because artificially 
redshifting galaxies out to redshift of only z = 0.07 
does not produce any noticeable changes in these 
quantities. Also, we do not correct for the decrease 
in galaxy flux that results from artificially redshift- 
ing the galaxy, since we normalize the image so 
that its total flux is unity, making flux correction 
unnecessary. 

Smoothing a galaxy by a PSF of width [3 
amounts to performing the convolution 



fc'(x) 



x — x)go(x.')d 2 x, 



(16) 



where /i'(x) is the image after convolving with a 
PSF go( x ) 01 standard width /? , and /(x) is the 
galaxy. Because we do not observe /(x) directly, 
but rather h(x) = J /(x — x')g(x)d 2 x, where g(x) 
is the actual PSF, we must develop a way to calcu- 
late /i'(x) from h(x). This may easily be done in 
shapelet space, where convolution is represented 
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Fig. 3. — The distributions of some physical quan- 
tities for Sample 2. 
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Fig. 4. — Selected galaxies from Sample 1, shown 
before artificial redshifting and smoothing (top) 
and after (bottom). 
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by multiplication (sec Rcfrcgicr & Bacon 2003). If 
we approximate the actual PSF g(x) as a Gaussian 
of scale j3, we can compute in x-space the image 
h'(x) smoothed to the standard scale (3$. We do 
this by performing the convolution 

fc'(x) - J h(x - x!)g'{x)d 2 x, (17) 

where g'(x) is a Gaussian of scale /?' = \J (3$ — fl 2 ■ 
If one is using a non-Gaussian smoothing ker- 
nel (PSF) or wishes to compute the h' n after de- 
composition then one must do the computation 
in shapelet space. Although the PSFs from the 
SDSS data are not exactly Gaussian, the error in 
the coefficients h' n introduced from approximat- 
ing them as so is negligible for our purposes. The 
scale of the shapelets used in the decomposition 
is 7' = ^/"f 2 — (3 2 + 0q , where 7' is used to cor- 
rect for the loss of resolution. This assures that 
7' > 0o- 

Using Equations (6) and (7), and our values for 
7' and xc, we now calculate the shapelet coeffi- 
cients by decomposing the galaxy image about its 
center, up to a maximum order n max = n\ + ri2- 
Because centering errors are introduced from the 
rotation, resizing, and smoothing of the image, 
and to assure the integrity of our decomposition, 
we compute the center of the image from Equa- 
tion (9) using these coefficients and redecompose 
the galaxy about this new center; we do this twice. 
We then compute the flux of the decomposed im- 
age from the shapelet coefficients (Eq.[8]) and di- 
vide the coefficients h' n by the flux. This nor- 
malizes the shapelet coefficients such that the flux 
of the image reconstructed from them is equal to 
unity. Finally, we convert the Cartesian shapelet 
coefficients to the polar ones using Equation (14). 
Figure 4 also shows the Hubble-classified galaxies 
after artificial redshifting and smoothing, and Fig- 
ure 5 shows the reconstruction of a galaxy from its 
shapelet coefficients. 

In this analysis we choose the standard physi- 
cal width /3 to be 2.0 kpc. A f3 of 2.0 kpc al- 
lows us to use a large number of galaxies (<~ 3000) 
out to a redshift of z — 0.07 (see Figure 3), re- 
sulting in a sample that is a good representation 
of galaxies that populate the nearby universe. In 
addition, a smoothing scale of /?o = 2.0 kpc al- 
lows the large-scale structures of galaxies to be 
sufficiently well resolved for classification. Small- 




Fig. 5. — A late type galaxy image (right), and the 
image reconstructed from the shapelet coefficients 
/„ (left), for n max = ni + n 2 = 12. The galaxy 
image was decomposed according to the method 
outlined in § 4. Also shown are the horizontal 
(middle) and vertical (bottom) cross-sections of 
the images, taken at half the opposite axis. The 
shapelet method accurately reconstructs the orig- 
inal image. 



scale structures do not result in appreciable differ- 
ences in shapelet coefficients within the context of 
morphological classification, and losing this infor- 
mation does not affect our classification method. 
We choose n max — 12, assuring a sufficiently ac- 
curate reconstruction for even those galaxies with 
the most complicated morphologies, while keeping 
the computation time reasonable. 

5. PRINCIPAL COMPONENT ANALY- 
SIS 

5.1. The Most Powerful Shapelet States 

We decompose the galaxies of Sample 2 accord- 
ing to the procedure outlined in § 4. In order to 
have an understanding of which shapelets consis- 
tently contain the most power, and thus give the 
best description of the most common aspects of 
the morphologies of nearby galaxies (z < 0.07), we 
have calculated the top ten most powerful shapelet 
states for all the galaxies in Sample 2. These states 
are shown in Table 1 for both the Cartesian and 
Polar shapelets, and several of them can be seen 



7 



Table 1: Top Ten Most Powerful Shapelet States. 



Cartcsian a 






Polar" 




Rank State (|/„|//tot) c 


<x„ d 


State 


<l/»l//*o t ) 





5.2. Principal Component Analysis 

Performing the shapelet decomposition for 
, lax = 12 results in 91 coefficients for each 
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4,0) 
2, o) 
0,4) 
6,0) 
2, 2) 
0, 2) 
8.0) 
0,1) 
1,0) 



0.293 
0.047 
0.059 
0.026 
0.022 
0.021 
0.030 
0.018 
0.016 
0.014 



0.076 
0.012 
0.021 
0.011 
0.010 
0.010 
0.020 
0.009 
0.015 
0.012 



WW 

|4,0> 

|2,0) 

2, -2) 

|2,2) 

|8,0) 

16,0) 

ll.-l) 
1 10,0) 

IM) 



0.309 
0.057 
0.050 
0.029 
0.029 
0.021 
0.023 
0.017 
0.016 
0.017 



"Cartesian states are labeled as \n\,n2). 
6 Polar states are labeled as \n,m). 

c Mean ratio of the coefficient power to the total coefficient 
power 

d Standard Deviation in the coefficient power ratio 



in Figures 1 and 2. As expected, the symmetric 
states are the most powerful, with the first order 
asymmetric state the only asymmetric one that 
appears among the top ten most powerful states. 
In the case of the Cartesian states, we see that the 
states with more horizontal structure (rii > n 2 ) 
are more powerful than those with more vertical 
structure (n 2 > n{). This is the result of the way 
in which we reduce the images before decompos- 
ing them, as we rotate the image such that its 
major axis is oriented along the horizontal. The 
fact that the horizontal states are more powerful 
than the vertical is merely a representation of the 
fact that there is more flux and structure along 
a galaxy's major axis than its minor. Also, the 
state |0, 0} (a Gaussian) is the same for both the 
polar and Cartesian representations, and contains 
a little less than one-third of the total power for 
both. 

For the polar shapelets, it is obvious that the 
states with azimuthal symmetry (m = 0) are the 
most powerful, and again this is not surprising. 
This reflects the fact that the galaxies of Sam- 
ple 2 exhibit more radial structure than angular 
structure. The only m ^ states are |2, ±2} and 
1,±1). The former state picks up the ellipticity 
of a galaxy, and the latter state is just the first 
order asymmetry state. 



0.089 galaxy; 91 dimensions in the shapelet coefficient 
0^029 space. Such a high dimensionality presents a sig- 
0.018 nificant barrier to understanding the nature of the 
aol5 galaxy distributions in shapelet space, as well as 
0.017 performing computations in it (e.g., density esti- 
aol2 mations). In order to reduce the dimensionality 
012 o f the space we perform a principal components 
analysis. The principal component transform (also 
called a Karhunen-Loeve transform) of a set of p- 
dimcnsional data is a linear transformation that 
computes the q principal components, for q < p. 
The principal components of a data set are very 
useful for reducing the dimensionality of the data. 
This is because the q principal components pro- 
vide the best rank q linear approximation to the 
data, and provide an orthogonal basis that max- 
imizes the variance along the components; i.e., 
the first principal component is in the direction of 
highest variance in the data, the second principal 
component is in the direction of highest variance 
subject to being orthogonal to the first principal 
component, etc. (Hastie, Tibshirani, & Fried- 
man 2001). In the context of the work done here, 
the principal components are linear combinations 
of shapelet states, and can be constructed using 
Equation (6) to form a 2-dimcnsional image. We 
denote the j th principal component by Pj(x) and 
its corresponding coefficient by aj . 

We apply the Karhunen-Loeve transform on the 
coefficients of the Cartesian states for the galax- 
ies of Sample 2 and find that 98.7 % of the vari- 
ance is contained within the first three principal 
components, with 96.7 % within the first princi- 
pal component. This implies that the information 
in the shapelet decomposition is incredibly redun- 
dant, and allows us to decrease the dimensionality 
of the space to something that is much easier to 
work with. We found that there was no noticeable 
difference between applying the transform to the 
Cartesian or polar coefficients. Although the polar 
coefficients are complex, the majority of the shape 
information is contained within the real part of the 
polar coefficients. This means that performing the 
transform on the real part of the polar coefficients 
gives nearly the same result as applying it on the 
Cartesian coefficients. This is not surprising, as 
the conversion from Cartesian to polar coefficients 
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is itself a linear transform. We choose to apply 
the principal component analysis to the Cartesian 
coefficients for convenience and to ensure that we 
do not lose any information, however insignificant, 
as would happen if we were to use the polar coef- 
ficients and ignore the imaginary part. 

We decomposed the galaxies of Sample 1 in the 
same manner as for Sample 2. We keep those 
galaxies of Sample f with absolute r-band mag- 
nitude R > —19, since we are only using Sample 
1 for reference and these galaxies are not included 
in the principal component transform or the den- 
sity estimation (see § 6.2). We divide the galaxies 
of Sample 1 into four types — early, mid, late, and 
edge-on. The early types include those Hubble- 
classified by (Nakamura et al. 2003) as E-Sa, the 
middle types include Sab-Sbc, and the late types 
include Sc-Im (note that Im galaxies are classi- 
fied as 'irregular'). The edge-on class contains 
those mid and late type galaxies with axis ratios 
b/a < 0.4, where b and a are the minor and ma- 
jor axis, respectively. Only a few of the pj's ex- 
hibited structure or separation of Hubble type for 
the galaxies of Sample 1. We use the first nine 
principal components, which includes those that 
showed Hubble type separation. This allows us 
to work within a 9-dimensional space spanned by 
{Pi , P2 , P3 , P4 , P5 , Pe , Pi , P8 , P9 } , greatly simplifying 
our analysis. It should be noted that not all of 
a galaxy's morphological information is contained 
within these principal components, but rather that 
a sufficient amount of information pertaining to 
the important differences in galaxy morphology is 
contained in them. It is only the information re- 
garding differences in morphology that is useful 
for classification, which is why we use these nine 
principal components. Principal components af- 
ter pg became increasing irregular and likely only 
serve to include small-scale information unique to 
each galaxy. Locations of the galaxies in the 2- 
dimensional slice spanned by the first two princi- 
pal components are shown in Figure 6, the pro- 
jections of the galaxies along the pj are shown in 
Figure 7, and the first nine principal components, 
Pj(x), arc shown in Figure 8. 

5.3. Description of the Principal Compo- 
nents 

Comparing the projections along the principal 
components with the images of the pj allows us 
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Fig. 6. — Locations of the galaxies of Sample 2 
(top left plot) and Sample 1 in the 2-dimensional 
slice spanned by p\ and pi. Using the reference 
galaxies from Sample 1, we are able to see that 
the different Hubble types are well separated in 
this plane. 
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Fig. 7. — Projections of the galaxies of Sample 1 
along the first nine principal components. From 
these plots, one can see that the Hubble types are 
well seperated along a few of the pj . 
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to better understand why the galaxies separate in 
this space in such a manner. The first Principal 
Component, _pi(x), accounts for the vast majority 
of the variance (« 97 %). It is slightly ellipti- 
cal and has mirror symmetry. We have fit the 
profile of pi and find that it is exponential, ex- 
cept near the edge. Comparison with the other 
principal components reveals that the coefficients 
of pi(x), oi, are typically several times larger in 
magnitude than those for the other Pj(x). This 
implies that p\ (x) forms what we may consider to 
be the basic galaxy shape, while the other pj are 
subsequent modifications to it. This is reinforced 
by the fact that the coefficients and the flux of 
pi are always positive, ensuring that galaxies do 
not contain 'holes' of negative flux. The late types 
typically have the least power in this component 
relative to the other principal components, while 
the early types typically have the most. This im- 
plies that 'more' of the morphology of early types 
is contained within pi(x) than for late types, i.e., 
a greater fraction of the total power in the co- 
efficients for early types is contained within this 
principal component. Because early types typi- 
cally have the simplest morphologies, this result is 
not surprising, since p\ certainly contains minimal 
structure. Since a greater fraction of the shapelet 
power is contained within p\ for early types (small 
Oj | ja\ ) , fewer corrections are needed to this shape 
for early types than for later types. In fact, most 
of the corrections to p\ for early types appear to 
be for the purpose of changing the shape of the 
radial light profile. 

In Figure 9 we show the dependence of several 
physical and morphological parameters as a func- 
tion of the coefficients of p\. One can see that 
a galaxy's concentration index, a common mor- 
phological measurement, correlates well with its 
projection along p\. We define the concentration 
index C as 

C = 5.01og 10 (^) (18) 

where rgo and r$o are the radii where the Pet- 
rosian ratio 77 is equal to 0.1 and 0.5, respectively. 
The Petrosian ratio is the ratio of the local sur- 
face brightness at some radius, as averaged over an 
annulus, to the average surface brightness within 
that radius. More concentrated galaxies (such as 
early types) will have higher values of C. In addi- 



tion, the r-band half-light surface brightness /J50 
also appears to be correlated with a\. We define 
/X50 to be the average surface brightness within the 
Petrosian half-light radius (77 = 0.5). The units are 
magnitudes per arcsec 2 . Although there is a sig- 
nificant spread in /_t 50 as a function of <zi , a broad 
linear is apparent; surface brightness increases (de- 
creases in magnitudes) as cq increases. We inter- 
pret this as meaning that galaxies with the sim- 
plest morphologies, at least as seen when resolved 
on the same physical scale, tend to have the high- 
est surface brightness. These galaxies also tend 
to be the most concentrated, and this relationship 
between concentration and surface brightness is in 
accordance with previous studies (Blanton et al. 
2003a). 

We also show the relationship between oq and 
some of a galaxies spectral properties, by plot- 
ting u — r color and spectral eigenclass as func- 
tions of ai. The spectral eigenclass is the nega- 
tive of the mixing angle, 4>kl, between the first 
and second spectral eigencoefficients, where the 
spectral eigencoefficients, ej, are found from a 
principal component analysis of 170,000 SDSS 
spectra (Yip et al. 2003). The mixing angle is 
4>kl = tan _1 (e 2 /ei), and the spectra cover the 
range 3450-8350 A. Red (elliptical) galaxies typi- 
cally have negative spectral eigenclass values, and 
blue (spiral) galaxies typically have positive val- 
ues. The first eigenspectrum is the mean of all 
galaxy spectra, with a continuum similar to a Sb- 
type. The second eigenspectrum is negative below 
<~ 5200 A, positive at longer wavelengths, and con- 
tains the majority of the stellar absorption infor- 
mation. The third eigenspectrum (used in § 6.3) 
is positive below ~ 6000 A, negative at longer 
wavelengths, and is dominated by line emission. 
One can see that less concentrated galaxies with 
more complex morphologies (i.e., smaller oq) have 
a broader range of spectral properties and are of- 
ten bluer than those galaxies with simpler, more 
concentrated shapes, but show a narrower range of 
a\ than the red galaxies. The coefficient of p\ does 
not appear to have a strong dependence on axis ra- 
tio other than the fact that the distribution in the 
projections along p\ narrows with increasing cllip- 
ticity. Relationships between these physical and 
morphological parameters were also examined for 
the other pj, but the structure was not as well 
defined. We do not discuss them because the a, 
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Fig. 8. — Images of the first nine principal compo- 
nents, constructed from their shapelet coefficients. 
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Fig. 9. — Plots of the coefficients for the princi- 
pal components as a function of several physical 
parameters. The correlations between pi and /X50 
and P2 and b/a are obvious. 



are often correlated with one another, making the 
discussion redundant. 

The second principal component, _P2(x), con- 
tains a significant amount of the ellipticity infor- 
mation of a galaxy. As can be seen in Figure 9, 
the ellipticity increases with decreasing coefficient 
of P2 in a well-defined, linear manner. The edge- 
ons separate well along this principal component, 
and have the most powerful coefficients of P2(x); 
in fact, P2(x) resembles very much what we would 
expect an edge-on spiral galaxy to look like, with 
the exception of negative flux and the 'peaks' seen 
in the vertical direction. These 'peaks' serve to 
narrow the galaxy in the vertical direction, since 
the positive values for the pixels in this region 
are multiplied by the a 2 , which are negative for 
edge-ons, and added to the positive values for the 
pixels from aipi(x). The early types also sepa- 
rate very nicely along this component. The early 
types are found to mostly have positive values for 
a 2 . This may seem odd at first, however, inspec- 
tion of pi(x) and P2(x) reveal why this is the case. 
The first principal component is slightly elliptical. 
Early type galaxies of ellipticity the same as that 
of pi should not require any modification from p 2 , 
and thus would have projections along p 2 of nearly 
zero magnitude. However, if the early type has 
ellipticity less than that of p\ , it will need the ad- 
ditional contribution from p 2 to elongate it in the 
vertical direction and narrow it in the horizontal 
direction. In the case where the ellipticity is less 
than that of p\ , the coefficient is negative, and the 
reverse of p 2 (x) is used to modify p\ . This results 
in the two symmetric 'peaks' in the vertical direc- 
tion becoming 'valleys', and thus would widen the 
contribution from p\ , making the galaxy more el- 
liptical. The coefficients of p 2 for the middle and 
late types are distributed preferentially negative, 
but with less magnitude than the a 2 for the edge- 
ons. This implies that, on the average, the late 
and middle types are slightly more elliptical than 
Pi- 

The other principal components, with the ex- 
ception of pi and ps , do not separate the Hubble 
types as well as the first two, nor do the distri- 
butions appear to have as much structure. The 
third and eighth principal components have light 
profiles that are more centrally concentrated than 
the previous two, as well as alternating annuli of 
negative and positive flux around the central peak. 
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Fig. 10. — Light profile of pi(x) before (left col- 
umn) and after addition (middle column) and sub- 
traction (right column) of selected principal com- 
ponents. Values of |aj|/ai used were chosen to be 
large enough to emphasize the contributions from 
the pj, but still characteristic of real galaxies. The 
vertical axis is shown logarithmically. 



Late types typically have positive values for a§, 
while early types have negative values. This mod- 
ification for late types results in a narrowing of 
the radial profile near the center and a widening 
farther out. For early types, ps flattens the light 
profile near the center and makes it steeper farther 
out. The effect of p^ is similar, with the exception 
that it does not contain as much radial structure 
as ps- The contributions of these two principal 
components help to define the shape of a galaxy's 
light profile, and in the case of late types may 
contain morphological information regarding the 
separation of the bulge and disk components. The 
light profile of pi (x) , as well as the result after the 
addition of P3(x) and ps( x ) for both positive and 
negative cij are shown in Figure 10. 

The fourth, fifth and sixth principal compo- 
nents contain information regarding asymmetry 
about the major (pq) and minor axis (p4 and ^5). 
The fourth and fifth principal components appear 
to have odd symmetry with each other about the 
major axis, i.e., Pi(— x) = — P5(x), and also con- 
tain further information regarding ellipticity. In 
spite of the similarities in the morphologies of P4 
and p 5 , their coefficients do not appear to be cor- 
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related. The coefficients of ^4,^5, and p§ may be 
thought of as giving a weighted asymmetry mea- 
surement. This is because the a,j are given by the 
usual inner product as aj — (pj(x)|/(x)), which is 
equal to the sum of the values of the pixels of our 
galaxy image weighted by the value of the corre- 
sponding pixel of the pj image. In this case, pref- 
erence is given to asymmetry close to the center, 
where the principal components have the highest 
flux. Because of this, these principal components 
do not do a good job of picking out galaxies where 
the asymmetry is far from the center and of lower 
surface brightness. The coefficients of the galax- 
ies for these pj appear to be symmetrically dis- 
tributed around zero regardless of Hubble type, 
as would be expected. The light profiles after the 
contributions from these principal components are 
also shown in Figure 10. 

Hubble types appear to have a small separation 
in pi, with the early and edge-on galaxies having 
negative a-j and the middle and late type galaxies 
having positive 0,7. The morphological contribu- 
tion from p-j appears to be small; for galaxies with 
positive 07 it may serve to make them more 'boxy' 
along their major axis, as well as correcting the 
light profile to make it more exponential far from 
the center, where p\ diverges from an exponential 
profile. For galaxies with negative 07, it may con- 
tribute to the creation of a low surface brightness 
disk viewed edge-on, such as would be seen in an 
SO or Sa galaxy. 

The ninth principal component does not signif- 
icantly separate Hubble types. The contribution 
from pg is small, and the values of ag are sym- 
metric about zero. The morphological contribu- 
tion from pg is to make the central isophotes more 
'boxy' and tilted at a ±45 degree angle from the 
major axis, where the tilt is dependent on the sign 
of ag. 

6. MIXTURE OF GAUSSIANS MODEL 
AND CLASSIFICATION 

6.1. The Statistical Model 

Now that we have a 9-dimensional representa- 
tion of the data, we wish to have a mechanism that 
identifies clusters in the data set. Each of these 
clusters then corresponds to a particular morpho- 
logical class. We use a mixture of Gaussians model 
to estimate the galaxy density in shapelet space, 
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and to form the basis of our classification system. 
To implement the mixture model method, we use 
the publicly available fastmix software (Moore 
1999), which employs multi-resolutional KD-trees 
to overcome many computational issues associated 
with mixture models. Here, we outline the impor- 
tant aspects of the method found in Connolly et 
al. (2000). The basic idea behind the mixture of 
Gaussians model is to regard each (i-dimensional 
data vector Xi as being drawn from a probabil- 
ity density F. Because the integral of F over the 
region occupied by the Xi is unity, i.e., the prob- 
ability of drawing each Xi over the entire range 
of the data is one, the probability density F is 
the normalized number density for the data. The 
distribution is modeled as 



F{x-e k ) 



3 = 1 



(19) 



where $ denotes a d-dimensional Gaussian of 
mean \x and covariance S. Each $ is given by 



$(x;M,£) = 



1 



(2 7 r) d / 2 |S| 1 /2 



exp 



1 



(x- /z) T E _1 (a;- fi) 
(20) 



The parameters to be estimated from the data set 
are the number of Gaussians k and 9k = (r, ^, E), 
where r = (r ,.. . ,r fe ), \i = (/j, , . ■ • ,Mfc), and E = 
(So, . . . , Sfe). The values of r are limited such that 
Tj > for all j and J2j=o T o = 1 ■ 

To estimate the 9k the method of maximum 
likelihood is used. Assuming that fc is fixed and 
known, this amounts to finding the 9k that maxi- 
mizes the likelihood function. The likelihood func- 
tion is defined as the probability of the observed 
data Xi as a function of the unknown parameters 
9k- In order to find the 9k that maximizes the 
likelihood function, fastmix employs the expec- 
tation maximization (EM) algorithm. To choose 
k, we opt for using the Bayesian Information Cri- 
terion (BIC, Schwarz (1979)). The BIC criteria 
corresponds to approximately maximizing the pos- 
terior probability of the k th model (in this case 
F{x;6k)), and gives preference to simpler mod- 
els (Hastie, Tibshirani, & Friedman 2001), e.g., a 
mixture of Gaussians model with fewer Gaussians. 

6.2. Classification Using the Mixture Model 

The mixture of Gaussians model provides a nat- 
ural basis for the use of morphological galaxy clas- 



sification. We expect that a galaxy's morphology 
is the result of its evolution, and galaxies of similar 
morphology, i.e., in the same morphological class, 
have undergone similar events in their evolution to 
bring about their morphology (e.g., mergers, col- 
lisions, spiral density waves, etc.). We also expect 
galaxies to have histories which are the result of 
physical processes unique to each galaxy, resulting 
in small variations about the 'mean' morphology 
of a class. We assume that these variations may 
be taken as random when looking over a large 
enough sample of galaxies. This results in the 
morphologies of galaxies in the same class being 
normally distributed about the average morphol- 
ogy. If the morphology of galaxies is normally dis- 
tributed about the mean shape, then the shapelet 
coefficients / n are also normally distributed about 
the mean coefficient, (/ n ). A mixture of Gaussians 
model then provides the most natural basis for 
classification, where galaxies of the same Gaussian 
are of the same class. Galaxies may be assigned 
a probability for each Gaussian, allowing one to 
give a measure of how likely it is that the galaxy 
'is in each class. This continuous, quantitative de- 
scription of classification is an advantage over the 
Hubble scheme, as it allows significant flexibility 
Rather than using discrete classes which do not 
allow for ambiguities, we are able to give a mea- 
sure of exactly how confident we are that a given 
galaxy is in a given class, and ambiguous galax- 
ies are not 'pigeon-holed' into classes where they 
do not fit. In addition, it should be noted that 
because we are interested in classification, rather 
than perfectly fitting the dataset, the BIC criteria 
allows for the most optimal fit while keeping the 
number of Gaussians (classes) reasonable. 

We use the 9-dimcnsional space spanned by the 
principal components {pj,j = 1,...,9} as the 
morphological space in which to work. To esti- 
mate the density distribution we used the coeffi- 
cients of the principal components for Sample 2 
and fed them into fastmix. Because most of the 
pj contain axis ratio information, and what we 
see in reality is a 2-dimensional projection of the 
galaxy, where the galaxy is tilted by an angle from 
our line of sight, we do not expect the observed 
galaxy morphologies to be perfectly normally dis- 
tributed. This tilt distorts the galaxy morpholo- 
gies, resulting in the shapelet coefficients diverg- 
ing from a normal distribution. This results in the 
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need for extra Gaussians to fit the density, which 
may create spurious class where the only differ- 
ence is that of axis ratio. Fortunately, it appears 
that this effect is small and differences in axis ra- 
tio do not distort the shapelet coefficients enough 
to require extra Gaussians except for the case of 
spirals viewed edge-on or nearly edge-on. 

After applying the mixture model estimation, 
we find that the 9-dimensional density distribu- 
tion is well fit with seven Gaussians. Using this 
distribution, we are able to classify galaxies by as- 
signing galaxies to the class (Gaussian) for which 
they have the highest probability, as well as as- 
signing an associated probability of being in each 
class. We compare the mixture class with Hub- 
ble type in Figure 11. In Figure 12 we show the 
locations of the galaxies for each mixture class in 
the 2-dimensional slice spanned by p\ and P2 , and 
in Figure 13 we show the mean projection of the 
galaxies along the principal componetns. Anal- 
ysis of the histograms in Figure 11 reveals that 
the mixture model classifications do an excellent 
job of separating the galaxies based on their mor- 
phology, and we can certainly see from Figures 
12 and 13 that the projections along the prin- 
cipal components for the largest mixture model 
classes correspond well with the Hubble types in 
Figures 6 and 7. This is encouraging, as it means 
that galaxies with commonly accepted morpholog- 
ical differences, i.e., of different Hubble type, form 
distinct classes in shapelet space. The classes are 
found from an analytical model that does not ad- 
mit any morphological assumptions (i.e., is model- 
independent), with the exception that the density 
distribution of galaxies in shapelet space can be 
well represented by a mixture of Gaussians. 

Because each galaxy has a probability of being 
in each class, we may calculate mean values of any 
parameter z (e.g., luminosity) for each class from 
the equation 



( z k) 



W ik Zi 



(21) 



where Wik is the probability that the i galaxy is 
in the k th mixture model class, Zi is the value of 
the parameter for the i th galaxy, and N ga i s is the 
total number of galaxies in the distribution. We 
use the normalization — 1, i.e, each galaxy 

has a probability of one of being in a class. This as- 
sumption may not necessarily be correct, and the 




Fig. 11. — Plots showing the results of the mixture 
model classification. The first histogram (top left) 
shows the distribution of mixture classes for the 
galaxies of Sample 2. The "Reduced T-type" seen 
in the following histograms are based on the Hub- 
ble classifications of Sample 1 and stands for the 
following: l=Early, 2=Middle, 3=Late, 4=Edge- 
on. Note that the x-axis is different for the first 
histogram because it shows the distribution of 
mixture classes, whereas the others show the dis- 
tribution of broad Hubble type in each Mk- The 
T-type values of the last plot are also from the 
Sample 1 data, and are as follows: 0=E, 1=S0, 
2=Sa, 3=Sb, 4=Sc, 5=Sdm, 6=Im. As can be 
seen, the Hubble types map very well to several of 
the mixture classes. The larger error bars seen in 
the bottom plots show the standard deviation in 
the distributions, and the shorter ones show the 
error in the mean. 
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Fig. 12. — Locations of the galaxies of Sample 2 
in the 2-dimensional slice spanned by pi and p2- 
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Fig. 13. — Projections along the principal com- 
ponents for the galaxies of Sample 2, classified by 
mixture model class Mk- The larger error bars 
show the standard deviation in the distributions, 
and the shorter ones show the error in the mean. 



addition of a uniform background distribution to 
the mixture model would likely alleviate this prob- 
lem; however we did not do this in this analysis 
and it does not appear that there are any outlier 
morphologies that would necessitate the addition 
of a uniform distribution. The variance of the pa- 
rameter z may be calculated in a manner similar 
to the mean. This is an advantage over classifica- 
tion systems that use discrete classes, such as the 
Hubble system, as it allows a quantitative com- 
parison between morphological classes and aver- 
age physical parameters. We show in Figure 14 
the images constructed from the mean shapelet 
coefficients and scales in each class, and in Figure 
15 we show the mean values of several physical 
parameters as a function of mixture model class. 
In addition, we also show a galaxy from each class 
before and after applying the procedure of § 4 in 
Figure 16. Hereafter, we will denote the k th mix- 
ture model class as Mk- 

6.3. Description of the Mixture Model 
Classes 

Mixture class is the main elliptical class 
with a probability t-2 k 0.266 (i.e., Mi is expected 
to contain ~ 27% of galaxies, see § 6.1), and M 6 
a minor elliptical class with tq « 0.045. The asso- 
ciation of these classes with early types is evident 
from the comparison with Hubble types seen in 
Figure 11 and the mean morphologies seen in Fig- 
ure 14. The mean Hubble type for M2 and Mq 
is SO. The physical properties of M2 and Mq are 
consistent with early types as well; these galax- 
ies, on average, have higher velocity dispersions, 
are the reddest, have the smallest scale 7, are the 
most concentrated, and have high surface bright- 
ness. Their profiles (i.e., Sersic index) are also 
consistent with early types, as are their spectral 
eigencoefficients. It should be noted that there is a 
bias in the computed Sersic index towards smaller 
n (Blanton, private communication). The effect 
is small for n ~ 1, but becomes larger for n ~ 3, 
such that a value of n ~ 3 is more like an actual 
Sersic index of ~ 4. The only difference between 
M2 and Mq is morphological; they have distinctly 
different coefficients of p$ and pg. This results in 
the noticeable twist of the inner isophotes seen in 
the mean morphology of Mg. 

Mixture class M3 corresponds to typical late 
type spirals and has T3 ps 0.340. The mean mor- 
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phology of M3 certainly is what we would expect 
for late spirals, and the mean Hubble type is Sc. 
In addition, the physical properties of the galaxies 
of M3 are consistent with late spirals. Galaxies of 
M 3 have low velocity dispersions, are blue, large, 
the least concentrated, have exponential profiles, 
and low surface brightness. The spectra of M3 
galaxies are also consistent with late types. 

The first mixture class (ti « 0.111) contains 
late type spiral galaxies that are viewed edge-on, 
as is obvious from Figures 11 and 14. These galax- 
ies display many of the same characteristics as 
those spirals viewed face-on (Ms), but are gener- 
ally dimmer, redder, more concentrated, and differ 
in their spectra. These differences are, of course, a 
result of the viewing angle. Galaxies viewed edge- 
on have a higher column density than those seen 
face-on. This results in an increase in internal 
extinction for the galaxies of M3, making them 
dimmer and redder (Elmcgreen 1998). Also, this 
increase in extinction reduces the contribution to 
the spectrum from the older bulge stars (Misiri- 
otis, Popcscu, Tuffs, & Kylafis 2001). We have 
compared the spectral eigencoefficients, ej, for Mi 
and M3 and find that, on average, the two classes 
exhibit different ej, with the most noticeable dif- 
ference in 63. 

Mixture class M 5 appears to be dominated by 
early type spirals and ellipticals, with and aver- 
age Hubble type of Sa and T5 0.201. The pro- 
jections along the principal components typically 
show values between the elliptical classes and the 
spiral classes. In addition, with the exception of 
Sersic Index, galaxies of M5 typically have physi- 
cal properties similar to those of M2 and M§. 

The fourth mixture class does not appear to 
be dominated by any particular Hubble type, has 
T4 « 0.024, and appears to contain a nearly equal 
amount of spirals and ellipticals. Based on the 
construction of the mean morphology, the average 
a,j , and the value for the Sersic index, is seems that 
M4 has a light profile that falls off steeply near 
the center and then levels out into an extended 
halo. The flux of these galaxies has a tendency 
to be slightly asymmetric near the center across 
the minor axis, as seen by comparing 04 and a 5 . 
Galaxies of M4 typically have the largest statis- 
tical spread in their physical and morphological 
properties and show physical properties character- 
istic of early types. In addition, M 4 is made up 



of the smallest and closest galaxies, with an aver- 
age redshift of z ~ 0.04, compared to z ~ 0.055 
for the other mixture classes. Spirals in M4 are 
typically small and show a central bulge that is of 
significantly higher surface brightness and smaller 
size than the disk. 

The last mixture model class, M7, is probably 
the most interesting. It is typically made up of 
late type spirals, both seen edge-on and face-on, 
with an average Hubble type of Sc and T7 s=s 0.013. 
The values of 03, 0,7, and a 8 are noticeably different 
from normal late type spirals (i.e., M3), and we in- 
terpret their contribution as resulting in a lack of 
separation between bulge and disk components, or 
in a bulge with a significantly larger scale length. 
In addition, galaxies of M-j appears to typically 
be slightly asymmetric about the minor axis, and 
are often asymmetric across the major axis, as evi- 
denced by the large spread in a%. Mixture class M7 
probably contains a significant fraction of morpho- 
logically irregular galaxies. Galaxies of Mj have 
the highest velocity dispersions and spectral eigen- 
class, and are the dimmest and bluest. They are 
of moderate size, and have central concentrations, 
Sersic indices, and surface brightnesses similar to 
M 3 . In addition, M7 is the only galaxy class with 
typically negative second and third spectral eigen- 
coefficients, meaning that, on average, they dis- 
play post-starburst activity in their spectra (Yip 
et al. 2003). 

7. CONCLUSIONS AND FUTURE WORK 

One of the major projects of modern extra- 
galactic astronomy is the pursuit of a quantitative 
and automatic morphological galaxy classification 
scheme. The Hubble sequence is becoming more 
inadequate, and new systems are needed. A quan- 
titative description of morphology will allow as- 
tronomers to give quantitative relations between a 
galaxy's morphology and its physical parameters. 
In addition, automating the classification scheme 
will allow the use of large astronomical databases 
for the analysis of galactic morphology, which will 
be of great benefit in investigating the physical 
significance of a galaxy's shape. 

In this paper, we have tested the shapelet de- 
composition method as a quantitative and auto- 
matic description of galaxy morphology. We ap- 
ply the method to a sample of ~ 3000 galaxies 
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from the Sloan Digital Sky Survey and show that 
galaxies of known Hubble type separate cleanly in 
shapelet space. In addition, using the vast amount 
of SDSS data allows the admission of powerful sta- 
tistical methods of analysis, such as principal com- 
ponent transforms and the mixture of Gaussians 
model. Applying the principal component analy- 
sis, we find that the morphological differences of 
nearby galaxies can be sufficiently described using 
nine principal components, allowing us to work 
within a 9-dimensional space spanned by these 
components. We show that each principal compo- 
nent contains unique morphological information, 
and sufficiently separates galaxies that are known 
to have different morphologies. 

Furthermore, we apply a mixture of Gaussians 
model to describe the density of galaxies in the 
shapelet space spanned by the principal compo- 
nents, where each Gaussian represents a morpho- 
logical class. The mixture model fits the distri- 
bution well to seven Gaussians, implying seven 
classes. The five 'main' classes were shown to 
correlate well with a broad description of Hubble 
type, whereas the other two classes were populated 
by galaxies that were often outliers in shapelet 
space. Our method is model- independent, purely 
objective, and automatic; the fact that the method 
is able to separate galaxies of different morpholo- 
gies so well is promising and attests to its efficacy. 

We show that galaxies of different morpholo- 
gies differ, on average, in their physical proper- 
ties. Much effort has been expended exploring the 
connection between the morphological appearance 
of a galaxy and its fundamental physical proper- 
ties, such as mass, luminosity, and stellar pop- 
ulation. There are such connections, and they 
have been detected through manual classification 
of galaxies. Most noteable of these is the corre- 
lation between color and Hubble type (Roberts 
& Haynes 1994). This paper shows that entirely 
automated classification schemes can also reveal 
these trends. It also confirms that connections 
between morphology and fundamental parameters 
are loose. Tighter correlations have been observed 
when comparing manual Hubble classifications to 
color; however, because of differences in resolu- 
tion between the sample used in this paper (2.0 
kpc) and that used in previous work with Hubble 
classifications, it is difficult to draw a conclusion 
as to whether the mixture classes have a closer 



connection to the spectral properties of galaxies 
than Hubble types. Indeed, it may well be that 
the overall morphology of a galaxy is more closely 
related to accidents of its history than to its fun- 
damental parameters. 

The method may be further refined by using 
data across several wavebands. We are using SDSS 
data to decompose the galaxy images in the u, g, 
r, i, and z bands, and applying the principal com- 
ponent analysis to the coefficients in all of these 
bands (Kelly & McKay, in preparation) . This will 
allow us to take into account information of the 
morphological differences in observing wavelength, 
and to construct principal components and coef- 
ficient arrays that contain spectral information as 
well. This will likely serve to make the separa- 
tion in shapelet space more apparent and may in- 
troduce new and interesting morphological classes. 
Traditional classification is done on only one ob- 
serving band, and using information on how a 
galaxy's shape varies across bands may introduce 
new information and classifications. Also, in an 
attempt to make the coefficient distributions more 
normally distributed (see § 6.2), and thus increas- 
ing the effectiveness of the mixture of Gaussians 
model, we are testing the use of shapelets of el- 
lipticity equal to that of the galaxy. This may 
take away the axis ratio dependence of the coeffi- 
cients, which was shown to distort the coefficient 
distributions such that the coefficients were not 
normally distributed in the direction of axis ra- 
tio information. It would be interesting to apply 
the shapelet method to high redshift galaxies and 
compare results to this local sample. As we only 
apply the method to galaxies in the nearby uni- 
verse (z < 0.07), these results are not necessarily 
applicable for higher redshift galaxies and further 
analysis of such galaxies is needed. 
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Fig. 14. — The mean morphologies of the mix- 
ture model classes, reconstructed from the mean 
shapelet coefficients for each respective class. The 
contours denote where the image drops to e~ J of 
its maximum value, where j is the contour num- 
ber from the center. For example, the first contour 
from the center labels where the flux has dropped 
to e" 1 of its maximum value. 
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Fig. 15. — The average of several physical proper- 
ties for each Mixture class. The larger error bars 
show the standard deviation in the distributions, 
and the shorter ones show the error in the mean. 
Sersic Indices are from Blanton et al. (2003a). 
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Fig. 16. — Galaxies from each Mfe, shown be- 
fore artificial redshifting and smoothing to a PSF 
width of fa (top) and after (bottom). 



